Extending the Entity-based Coherence Model with Multiple Ranks
نویسندگان
چکیده
We extend the original entity-based coherence model (Barzilay and Lapata, 2008) by learning from more fine-grained coherence preferences in training data. We associate multiple ranks with the set of permutations originating from the same source document, as opposed to the original pairwise rankings. We also study the effect of the permutations used in training, and the effect of the coreference component used in entity extraction. With no additional manual annotations required, our extended model is able to outperform the original model on two tasks: sentence ordering and summary coherence rating.
منابع مشابه
An Optimal Approach to Local and Global Text Coherence Evaluation Combining Entity-based, Graph-based and Entropy-based Approaches
Text coherence evaluation becomes a vital and lovely task in Natural Language Processing subfields, such as text summarization, question answering, text generation and machine translation. Existing methods like entity-based and graph-based models are engaging with nouns and noun phrases change role in sequential sentences within short part of a text. They even have limitations in global coheren...
متن کاملExtending the Entity-grid Coherence Model to Semantically Related Entities
This paper reports on work in progress on extending the entity-based approach on measuring coherence (Barzilay & Lapata, 2005; Lapata & Barzilay, 2005) from coreference to semantic relatedness. We use a corpus of manually annotated German newspaper text (TüBa-D/Z) and aim at improving the performance by grouping related entities with the WikiRelate! API (Strube & Ponzetto, 2006).
متن کاملNon Secretory Multiple Myeloma With HCV Infection: A Rare Case Entity
Multiple Myeloma is a neoplasm of B cell lineage characterized by excessive proliferation of abnormal plasma cells. It is characterized by a clinical pentad of 1) anemia, 2) a monoclonal protein in the serum or the urine or both, 3) bone leisons and or bone pain, 4) hypercalcemia >11.5g/dl and 5) renal insufficiency. Non secretory multiple myeloma is a rare variant of the classic form of multi...
متن کاملA Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features
Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...
متن کاملThe Mediating Role of Sense of Coherence in the Relationship between Perceived Stress with Fatigue and Pain in Multiple Sclerosis Patients
Background and Objectives: Fatigue and pain are the common complications in multiple sclerosis patients, which is influenced by the patients’ psychology as well as stress. The current study aimed at investigating protective mediating role of sense of coherence in the relationship between perceived stress and fatigue/pain in Iranian MS patients. Methods: This cross-sectional study was carried ou...
متن کامل